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PRELIMINARY AMENDMENT 



BOX PATENT APPLICATION 

Assistant Commissioner for Patents September 13, 2000 

Washington, DC 20231 

Sir: 

The following Preliminary Amendments and Remarks are 
respectfully submitted in connection with the above-identified 
application . 



IN THE TITLE: 

Please amend the title to read: 

--A SIGNAL PROCESSING METHOD TO ANALYSE TRANSIENTS OF SPEECH 
SIGNALS-- 

IN THE SPECIFICATION: 

Please amend the specification as follows: 

Before line 1, insert — This application is the national 
phase under 35 U.S.C. § 371 of PCT International Application No. 
PCT/DK99/00128 which has an International filing date of March 
12, 1999, which designated the United States of America. ~ 



AMENDMENTS 



Docket No. 859-105P 



IN THE CLAIMS: 



Please amend the claims as follows: 

Claim 4: Line 1, change "any of the preceding claims" to 
--claim 1-- 

Claim 5: Line 1, change "any of the preceding claims" to 
--claim 1-- 

Claim 6: Line 1, change "any of the preceding claims" to 
--claim 1 — 

Claim 7: Line 3, change "any of the preceding claims" to 
--claim 1 — 

Claim 9: Line 3, change "any of the preceding claims" to 
--claim 1-- 

Claim 11: Line 4: change "any of claims 1-6" to 
— claim 1 — 

Claim 12: Line 2, change "any of claims 1-6" to 
--claim 1-- 

Claim 18: Line 1, change "any of claims 1-6" to 



— claim 1 — 



REMARKS 



The specification has been amended to provide a cross- 
reference to the previously filed International Application. The 
claims have also been amended to delete improper multiple 
dependents and to place the application into better form for 
examination. Entry of the present amendment and favorable action 
on the above-identified application are respectfully requested. 



2 



Docket No. 859-105P 



If necessary, the Commissioner is hereby authorized in this, 
concurrent, and future replies, to charge payment or credit any 
overpayment to Deposit Account No. 02-2448 for any additional 
fees required under 37 C.F.R. § 1.16 or under 37 C.F.R. § 1.17; 
particularly, extension of time fees. 



Respectfully submitted, 



BIRCH, STEWART, KOLASCH & BIRCH, LLP 



R<5£yrtiond C. "Stewart, #21,066 




RCS/cqc 
859-105P 



P.O. Box 747 

Falls Church, VA 22040-0747 
(703) 205-8000 
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A SIGNAL PROCESSING METHOD FOR DETERMINATION OF A PARAMETER OF A 
SYSTEM GENERATING THE SIGNAL 

The present invention relates to a method for determination of a 
5 parameter of a system generating a signal containing information 
about the parameter. 

The method may be used for identification of sound or speech 
signals, such as^ in speech recognition, or for quality measurement 
10 of audio products or systems, such as loudspeakers, hearing aids, 
telecommunication systems, or for quality measurement of acoustic 
conditions. The method of the present invention may also be used in 
connection with speech compression and decompression in narrow band 
telecommunication . 



The method may also be used in analysis of mechanical vibrations 
generated by a manufactured device during operation e.g. for 
detection of malfunction of the device. 

20 The method may further be used in electrobiology for example for 
analysis of neuroelectrical signals such as analysis of signals 
from an electroencephalograph, an electromyography etc. 

BACKGROUND OF THE INVENTION 

25 

The three documents 

HALIJAK C A et al.: "Simple Consequences of the Finite Time Laplace 
Transform Analysis of the Periodically Reversed Switched Capaci- 
30 tors", CIRCUITS, SYSTEMS, AND SIGNAL PROCESSING, 1985, USA, vol. 4, 
no. 4, pages 503-511, XP-002 105 4 4 6 , ISSN 0278-081X; 



15 



35 



BARRETT T W: "The Cochlea as Laplace Analyzer for Optimum 
(Elementary) Signals", ACUSTICA, Feb. 1978, WEST GERMANY, vol. 39, 
no. 3, pages 155-172, XP-002105 445 , ISSN 0001-7884; and 
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HARBOR R D et al . : "THE LAPLACE TRANSFORM", ENERGY AND INFORMATION 
TECHNOLOGIES IN THE SOUTHEAST, Columbia, April 9-12, 1989, vol. 1, 
9 April 1989, pages 376-379, XP-00007 68 24 , IEEE; 

5 offer relevant background art as regards the Laplace transform. 

Prior art methods of signal processing are based on a short time 
Fourier transform of signals and it is assumed that the signals are 
steady state signals. 

10 

In steady state analysis the signal is assumed stationary in the 
period the signal is analysed and the steady state spectrum is 
calculated , 

15 In real life steady state signals do not occur and steady state 

analysis does not provide sufficient knowledge of phenomena within 
various scientific and technological fields. Consider for example 
speech analysis. The human ear has the ability to simultaneously 
catch fast sound signals, detect sound frequencies with great 

20 accuracy and differentiate between sound signals in complicated 

sound environments. For instance it is possible to understand what 
a singer is singing in an accompaniment of musical instruments. 

It is assumed that the cochlea in the human ear can be regarded as 
25 comprising a large number of band-pass filters within the frequency 
range of the human ear. 

The time response f(t) for one band-pass filter due to an 
excitation can be separated into two components, the transient 
30 response, f t (t), and the steady state response, f £ (t), 
f (t)=f t (t)+f 5 (t) . 

Traditional signal processing is based on the steady state response 
f 5 (t), and the transient response f t (t) is assumed to vanish very 
35 fast and to be without importance for the perception, see for 

example "Principles of Circuit Synthesis", McGraw-Hill 1959, Ernest 
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5. Kuh and Donald 0, Pederson, page 12, lines 9-15, where it is 
stated that: 

"only the forced response is considered while the response due to 
5 the initial state of the network is ignored". 

Thus, when students are introduced to the world of signal analysis, 
they learn that the transient response, i.e. the response due to 
the initial state of the network should be ignored because it 
10 vanishes within a very short period of time. Furthermore, it is 
rather difficult to analyse these transient signals by use of 
traditional linear methods of analysis. 

The ability of the human ear to hear very short sounds and at the 
15 same time detect frequencies with great accuracy is in conflict 

with the traditional filterbased spectrum analysis. The time window 
(twice the rise time) of a band-pass filter is inversely 
proportional to the bandwidth, tw=2/ ( f u -f x ) , 

where f x is the lower cut-off frequency and f u is the upper cut-off 
20 frequency. 

Thus, if a rise time of 5 ms is required the consequence is that 
the frequency resolution is no better than 400 Hz. 

25 As the detection of these transients is in conflict with a high 
frequency resolution, the detecting by the human ear of these 
transients must take place in an alternative manner. It has not 
been examined how the human ear is able to detect these signals, 
but it might be possible that the cochlea, when no sounds are 

30 received, is in a position of rest, where the cochlea will be very 
broad-banded. When a sound signal is received, the cochlea may 
start to lock itself to the frequency component or components 
within the signal. Thus, the cochlea may be broad-banded in its 
starting position, but if one or more stable frequencies are 

35 received the cochlea may lock itself to this frequency or these 
frequencies with a high accuracy. 
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Today it is known that the nerve pulses launched from the cochlea 
are synchronized to the frequency of a tone if the frequency is 
less than about 1.4 kHz. If the frequency is higher than 1.4 kHz 
5 the pulses are launched randomly and less than once per cycle of 
the frequency. 

Signal processing based on filter bank spectrum analysis is 
disclosed in GB 2 213 623 , which describes a system for phoneme 

10 recognition. This system comprises detecting means for detecting 
transient parts of a voice signal, where the principal object of 
the transient detection is the detection of a point where the 
speech spectrum varies most sharply, namely, a peak point. The 
detection of the peak points is used for more precise phoneme 

15 segmentation. The transient analysis of GB 2213623 is based on a 
spectrum analysis and the change in the spectrum, which is very 
much different to the transient analysis of the present invention, 
which is based on a direct transient detection in the time domain. 

20 SUMMARY OF THE INVENTION 

The present invention provides an approach, which is different in 
principle from all known methods for processing signals. The 
approach taken and some of the results obtained will be explained 
25 by of an example in the context of analysis of speech signals. 

Speech is produced by means of short pulses generated by the vocal 
chords in the case of voiced speech and by friction in the vocal 
tract in the case of unvoiced speech. The pulses are filtered by 

30 the vocal tract that acts as a time-varying filter. The output 
response will consist of quasi steady state terms and also 
transient terms. The quasi steady state terms will only be damped 
slightly in the period before the next pulse is generated. The 
transient terms will be sufficiently damped in the time period 

35 before the next pulse is generated. 
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The speech signal is often assumed to have only quasi steady state 
terms in the period or time window of the analysis, typically 20-30 
ms . 

5 The placement of formants, the formants being energy bands in the 
short time power spectrum, are calculated by means of a short time 
spectrum analysis has previously been assumed decisive for speech 
intelligibility, together with voiced/unvoiced detection, the pitch 
and the quasi steady state power. 

10 

However, a number of observations, which has been performed within 
the field of auditory perception research, does not conform to the 
previous assumptions: 

15 Why is it possible to understand and identify a deep male voice 

through communication channels that have a higher cut-off frequency 
than the male pitch. 

The only difference between the pronunciation of the letters: e, b, 
20 d is in the first 1-3 ms of the voice signal and this information 
will be lost if the analysis have a time window of 20-30 ms . 

How can the absolute placement of these formants be decisive when 
their placement is quite different for different people, 
25 particularly between small children and large males. 

Why is distortion dominated by odd order harmonics and caused by 
cross-over distortion in a class B amplifier much more disturbing 
than distortion dominated by even order harmonics caused by 
30 amplitude distortion in a class A amplifier. 

The short time power spectrum will not distinguish frequencies from 
different sources, and tones generated by other sources than the 
speech signal will act like false formants. 

35 
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Why does a signal consisting of three tones with the same 
frequencies as the formants for a vowel not give the slightest 
perception of the vowel at all? The signal just sounds like three 
separate tones . 

5 

Why is the ear very sensitive to frequency changes of a signal up 
till about 1000 Hz, changes of +/- 3 Hz can be detected. For 
frequencies above 1000 Hz, the sensitivity is much smaller. 

10 The research performed by the present applicant leads to suggest 
that the ear is tone dominant until about 1.4 - 1.6 kHz and 
transient dominant above. Tone dominant means that the pulses 
launched from the hair cells as a response to a tone signal are 
synchronised to the tone signal. Transient dominant means, in the 

15 present context, that the hair cells are activated by changes of 
the energy with rise and fall times of at most 2 ms typical caused 
by transient pulses. 

Regarding speech signals, it is assumed that the quasi steady state 
20 terms are in the tone dominant interval of the ear and that the 
transient terms are in the transient dominant interval. It is 
believed that the transient terms are very important for speech 
intelligibility. The transient terms are seen as transient pulses 
in the speech signal. The rise time and the shape of leading and 
25 lagging edges of the envelope of transient pulses in the terms of a 
profile of damped frequencies describes the sound picture. The 
shape of the leading and lagging edges, the dynamic changes, change 
of amplitude, of the transient pulses, voiced/unvoiced detection 
and the changes of pitch are decisive for speech recognition. 

30 

This approach provides a number of advantages with respect to 
explaining the earlier mentioned speech perception observations. 



A natural explanation as to why it is possible to understand and 
35 identify a deep male voice through communication channels that have 
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a higher cut-off frequency than the male pitch is provided. The 
pitch can be detected as the period between transient pulses. 

The absolute placement of formants is not decisive. The damped 
5 frequencies profile of the shape of the transient pulse envelope is 
dominated by damped difference frequencies of the transient terms. 

Distortion caused by cross-over distortion in a class B amplifier 
generates abrupt energy changes (unwanted transients) which are 
10 much more disturbing than distortion caused by amplitude distortion 
in a class A amplifier which do not generate the same abrupt energy 
changes . 

Robust data- or telecommunication is based on modulation. The 
15 envelope of transient pulses is a kind of amplitude modulation, 
transient or impulse response modulation, and will have the same 
advantages * 

It is unlikely that frequencies from other sources will cause 
20 interference patterns with the speech signal that gives energy 
changes with time constants and shapes in the range that is 
decisive for speech intelligibility. This means that transient 
modulation will be robust in noisy environments and communication 
channels . 

25 

The ear is probably very sensitive to changes of a frequency up 
till about 1000 Hz because the nerve pulses are synchronised to the 
frequency and the period between the pulses is a measure for the 
frequency. In the high frequency range, where the pulses are not 
30 synchronised to the frequency, only placement of the frequency in 
the cochlea is a measure for the frequency. 

According to the invention it has for example been found that the 
signal information relevant to recognition of speech is present in 
35 a transient part of the speech signal. Thus, the method of the 

present invention may involve a separation of the transient part of 
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an auditory signal, a generation of a transient pulse corresponding 
to the transient part, and analysis of the shape of the pulse. In 
an auditory signal, the corresponding transient pulse may be 
repeated with time intervals f and the time interval of these 
5 periodic transient pulses is normally also analysed or determined. 

In real life, the human ear reacts to energy changes at high 
frequencies in order to recognise phonemes or sound pictures. But 
in the present method transient pulses corresponding to the energy 

10 changes observed by the ear are extracted at these high 

frequencies, wherefore the transient pulses preferably are 
transformed to the low frequency range still maintaining the 
distinct features of the sound pictures or phonemes. Thus, by using 
the principles of the invention, it is possible to obtain distinct 

15 features within auditory signals by examining the transformed low 
frequency signals. 

The invention relates to the use of the shape of energy changes of 
a signal for identifying or representing features of the system 
20 generating the signal for example in recognition of sound features 
which can be perceived by an animal ear such as a human ear as 
representing a distinct sound picture are determined. 

The method of the present invention provides an expression for the 
25 transient conditions of the auditory signal. The method comprises a 
band-pass filtration of an auditory signal within the frequency 
range of the human ear and a detection of a low-pass filtered 
envelope, which envelope then can be analysed with known methods of 
signal analysis. The envelope is an expression of the transient 
30 part of the signal. 

The method of signal analysis, which should be used when analysing 
the envelope, and the characteristics of the band-pass filter, 
which should be selected, will depend on the purpose of the 
35 analysis. The purpose may be speech recognition, quality- 



9 



measurement of audio products or acoustic conditions, and narrow 
band telecommunication . 

The invention also relates to a system for processing a signal to 
5 reduce the bandwidth of the signal with substantial retention of 
the information of the signal. The system may further comprise 
means for extracting the transient component of the auditory 
signal , and it may comprise means for detecting an envelope of the 
transient component. 

10 

A signal may be separated into a sum of impulse responses generated 
by poles and zeroes in the system that has generated the signal, if 
the time between the excitation pulses are sufficient long compared 
to the duration of the impulse responses for the system. 

15 

In WO 94/25958 it is shown that the envelope of the transient 
component in a speech signal is very important for its recognition 
and it is shown that the envelope of the impulse response will 
contain exponential functions and difference frequencies defined by 
20 the impulse response. 

A method based on damped sinus functions to extract important 
features from the envelope signal is described, and examples where 
the method is used on speech signals shows that the features are 
25 important in speech analysis. 

Before entering into a more detailed explanation of features of the 
method of the invention, a few definitions will be given: 

30 In short time analysis the transient component in a signal is a 
matter of definition. For auditory signals, the idea is to obtain 
an expression that gives a response corresponding to the response 
in the cochlea to an abrupt change in the signal energy. An abrupt 
change in the signal energy corresponds to the transient component 

35 in the auditory signal- Thus, in the present context, the term 
"transient component" designates any signal corresponding to an 
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abrupt energy change in an auditory signal. The transient component 
holds the signal information to be analysed and in order to analyse 
this information the transient component may be transformed to a 
corresponding transient pulse having a distinct shape. Thus, in the 
5 present context, the term "transient pulse" refers to a pulse 

having a distinct shape and substantially holding the information 
of the transient component of the auditory signal and thus 
corresponding to an abrupt change in the energy of the auditory 
signal. As mentioned above the transient part of a sound signal may 
10 be repeated with time intervals and thus, in the present context, 
the term "periodic" when used in combination with a transient 
component, response or pulse designates any transient component, 
response or pulse being repeated with intervals. 

15 The term "shape" designates any arbitrary time-varying function 
(which is time-limited or not time-limited) and which, within a 
given time interval T p has a distinctly different amplitude level 
in comparison with the amplitude level outside the interval. Thus, 
T p is the duration of the shape function when the shape function is 

20 time-limited, or the duration of the part of the function which has 
a distinctly different amplitude level in comparison with the 
amplitude level outside the time interval. 

In order to extract information from the shape of the energy 
25 changes, one broad aspect of the invention relates to represent the 
shape of the energy changes by the short time Laplace transform of 
a transient pulse of the signal. However, several methods can be 
applied in order to obtain a transient pulse corresponding to the 
change in energy, but it is preferred that an envelope detection is 
30 being used, where the envelope preferably should be detected from a 
transient response of the energy change in the auditory signal. 

The energy change representing the distinct sound picture can be a 
phoneme or vowel or any other sound which gives a sudden energy 
35 change in an auditory signal. 



11 



It is also an aspect of the invention to provide a method for 
identifying, in an auditory signal, energy changes which can be 
perceived by an animal ear such as a human ear as representing a 
distinct sound picture, the method comprising comparing the shape 
5 of energy changes of the signal with predetermined energy change 
shapes representing distinct sound pictures. For the identification 
it is preferred that the shape of the energy changes are 
represented by the shape of a transient pulse of the signal, and it 
is furthermore preferred that the shape of the transient pulse 
10 should be obtained by an envelope detection of a transient response 
of the energy change in the auditory signal. 

The invention also relates to a method for processing a signal so 
as to reduce the bandwidth of the signal with substantial retention 
15 of the information of the signal, comprising extracting a transient 
part of the signal. The method may further comprise detecting an 
envelope of the transient part of the signal. 

Known methods of processing signals are based on a short time 
20 Fourier transform of signals, and it is assumed that the signals 
are steady state signals. 

In steady state analysis the signal is assumed stable in the period 
the signal is analysed, and the steady state spectrum is 
25 calculated. 

In WO 94/25958 it is disclosed that transient pulses are important 
for speech coding and decoding in narrow band communication, for 
speech recognition and synthesis, and for sound quality in auditory 
30 products (i.e. loudspeakers, amplifiers and hearing aids) . 

An important part of a transient signal is the exponential 
functions or damping ratios or time constants. The damping ratio is 
the reason that the impulse response has a finite duration. The 
35 fact that the transient signal is important for auditory perception 
indicates that the response from the hair cells is dependent on the 
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time constants. If this is the case, it is possible that the 
damping ratios in the response from nerve cells in general are 
important for the human nerve system. 

5 Transient signals are also important in many other applications, 
among others signals generated by impacts from defects in rolling 
bearings and gearboxes. 

Based on the transient signal, it is possible to determine the 
10 natural time constants and frequencies in the system generating the 
signal. Further it is possible to determine the excitation pulses 
of the system. 



BRIEF DESCRIPTION OF THE DRAWINGS 



15 



Fig. 1 



shows a time-domain representation of a linear time- 



invariant system; 



20 



Fig. 2 



shows the impulse response of a Butterworth low-pass 
filter of 3. order and a cut-off frequency at 700 Hz, 



Fig. 3 



sho'ws the response with the filter relaxed for 
t< 0 and with a 4000 Hz tone as input at t>Q, 



25 Fig. 4 



shows the s-plane with poles and the zero for H(a,0)) , 



Fig. 5 



shows H(a,co) for co x and 0) 2 analysed parallel with the 



a axis, 



30 Fig. 6 



shows transient characteristics in speech signals, 



Figs. 7-12 show processed speech signals, 



Fig. 13 



shows a schematic of a filter bank according to the 



35 



present invention. 
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DETAILED DESCRIPTION OF THE DRAWINGS 

The importance of the transient part of a signal has been an 
overlooked phenomenon in signal analysis. 

5 

The response of a linear system to either an impulse or a step 
function is defined by its transient response properties. 

The relationship between the input and the output for the linear 
10 time-invariant system shown in Fig, 1 can be written as the 

convolution of the input signal and the impulse response of the 
system: 



If the system is initially relaxed and the input signal v?(t) is 
zero for t< 0 then the lower integration limit of Eq. (1) can be 
replaced with zero. Eq. (1) then shows the important role played by 
the impulse response in terms of the actual signal processing that 
20 is performed by the system. It states that the input signal is 

weighted or multiplied by the impulse response at every instant in 
time and, at any specific point in time, the output is the 
summation or integral of all past weighted inputs. 

25 The impulse response of a real system has a finite duration and the 
transient response has the same duration. Fig. 2 shows the impulse 
response of a Butterworth low-pass filter of 3. order and a cut-off 
frequency at 700 Hz. Fig. 3 shows the response with the filter 
relaxed for /< 0 and with a 4000 Hz tone as input at t > 0 . 



In many processes v*(7) will be a pulse with a short duration and 
v f (/) 0 before the next pulse will be generated. 




(1) 



-co 



15 



The Laplace transform of a signal v(t) is defined by 
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L(s) = jv(tye-"dt (2) 

0 

0 

5 

If v(/) is the impulse response h(t) for a system with 2 complex 
poles 

h{t) = e" Cffo+;£Uo)f + >' , / > 0 ( 3 ) 

10 

and 0 for/<0 and s ^ ~(a 0 ± j co 0 ) . 
The Laplace transform is 



15 H(s)~ S + a ° - 



(j + cr 0 +yo? 0 )(j + cr 0 -yfl> 0 ) 



or 



(cr + cr 0 + j(ce + fl> 0 ))(cr + <r 0 + y(<z> - tf> 0 )) 



(4) 



20 

From Eq. (4) it is seen that for (a,<») -» (-<7 0 ,±tf> 0 ) , fi?) — > ±oo . 

This is a well-known phenomenon and a logical consequence of this 
is as follows: 

25 

If the signal analysed is dominated by the impulse response of the 
system generating the signal, it is possible to determine the 
natural time constants and frequencies for the system. 



30 Fig. 5 shows a plot of H(a,0)) for CD-CQ X and CO -Q) 2 
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Analysing a signal along or parallel with the jco axis will give a 
frequency profile for a given a. 

5 Analysing a signal along or parallel with the a axis will give a 
time constant profile for a given jco. 

If a signal has a time constant profile with significant variations 
for specific frequencies, the signal is transient dominated. 
10 Opposite if the signal does not vary significantly for any 
frequency, the signal is steady state dominated. 

A short time Laplace transform is defined by: 



in which v L is the signal, L is the transformed signal, a is a time 
constant, and to is an angular frequency. 

20 It is not possible to calculate the short time Laplace transform in 
the same way as DFT in the discrete time domain because two 

arbitrary exponential functions, e aT and e bt , are not orthogonal 
with respect to each other. 

25 The short time Fourier analysis in the analogue time domain is 

based on a filter bank method. In this paper an equivalent method 
will be developed for the Laplace transform. 

From Eq. (1) and Eq. (3) : 
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(5) 



o 



30 
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t 

+ \vlt-X)e' ia - ,m ' )k dX (6) 
o 

v o (0 - V(a,co, 0 + V* ( a, 0), t) = u(t) + u(t) 

5 where u*(t) is the complex conjugate of u(t) and we have 

Re[L(o-,o?,/)] = iv o (/) (7) 

From Eq. (6) and Eq. (7) it is seen that filtering the signal v^t) by 
10 a filter with the impulse response h(a,0)j) with 2 complex poles 
will represent the reel part of the short time L(o^COj) transform. 

If we let v^t) be equal to the impulse response of a single pole we 
have 
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0 

t 

=z ke^^ j e ( ^ +ja " )A e Ha+J< " )A dA (8) 



k(e~^ +ja})i — e~ (ar ° +;t2;o ^) 
(<t-<t 0 ) + jX<d-o> 0 ) 

20 and from Eq. (7) we have 

2k(cr-a 0 )(e~ at cos{cot) - e~ aQt cos(o) 0 /)) 



2k(a> - © 0 )(e" or sin(<af) - e _<v sin(a> 0 0) 

4- ; (9a) 

(a-o- 0 )- + ((o-a> 0 Y 



or 



v o (t) ((c - a 0 ) cos(a> 0 t) - O - co 0 ) sinQ 0 /)) 

Zj = - 

2k O^o)" +(^-^>o)" 
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- e~ at ((a - cr 0 ) cos(a#) - (a> - a> 0 ) sin(&#)) 

; vi — ; Zz ( 9b > 

(cr-CT 0 ) +{CO~CO 0 ) 

Eq.(9) is not defined for (a,0)) = (a 0 ,O) 0 ) but from (8) we have in 
this case 

5 

t 

u{t)^ke'- {a ^ J6> ^\dX 

0 

= kte~ (<y ° +J<Vo)t 

10 and 

v o (t) = Ikte"^ cos(> 0 /) (10) 
and we have v o (/)— >0 for / — > oo . 

15 

Eq. (9) shows that the gain is inversely related to c~O" 0 and 

0) — Q) n , and when (cr o ,&> 0 ) is far from and e~ at — e~ ff ° f is small/ 

v o (0~0* For K^o)^^^) v o (0 will have Eq.(10) as the limit. It 
is not immediately to see if Eq. (9) has the maximum energy for 
20 (a n ,a> 0 )<-(cr,o>) . 

In the DC domain Eq. (9) can be written as 



25 v 0 (t) = 2k± (11) 

cr-a n 



The maximum for v o (J) can be found as follows 



^- = -i-[ OB --a J> «-'] = 0 

dt <j - a 0 L J 
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when 



f ^ log(^)-logK) 



and Eq. (11) will have the maximum for this value. 

5 

It can be shown that t m — > ~ when a — >■ <T 0 . 



When cr « cr 0 we will have the approximated maximum with ? = ~ 



10 v 0 (£)=2*^ 1 (13] 



From Eg. (13) it can be shown that 

Ike' 1 

V Q -» for cr — » cf q 

15 

In Eq.(ll) e~ CT ° f represent the signal to be analysed and e~ m the 
filter. Table 1 shows the result with a filter having 
a = 100 s~ x and the signal varying from 1 to 10000 s" 1 

20 It is not surprising that the convolution acts as a low-pass 

filter. The important fact is that the exponential function in the 
DC domain in some way acts as frequencies do in the frequency 
domain . 

25 In table 1 v ol (/ w ) is the result of a convolution where the signal 
is differentiated. The result is, as expected, a high-pass filter. 



If we look on Eq. (9a) without exponential functions it can be 
written as 

30 
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v 0 (0 



2k(sm(cot) - sin(co 0 t)) 



CO — COf, 



(14) 



it is seen that for co — > oo we will have v — > 0 . 



a : 


100 s" 1 








t 

m 




v„i(0 


s" 1 


S 






1 


0, 046516871 


0, 954548457 


0 f 009545485 


10 


0, 025584279 


0,774263683 


0, 077426368 


100 


0, 010000000 


0, 367879441 


0, 367879441 


1000 


0, 002558428 


0, 077426368 


0,774263683 


10000 


0, 000465169 


0, 009545485 


0, 954548457 



Table 1 v o (/ ffl )is given by Eq. (11, 12) and normalised 
by <j and 2k. v 0 j(^ m )is a convolution where the 
signal is differentiated and normalised by 2k. 



10 For CO « G> 0 we will have 



v = 



2k($in(cot) - sin(o) 0 t)) 



CD. 



(15) 
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It can be shown that for CO — > (O 0 we will have 

v o (t) — » 2ktcos(co () t) 
This result is as expected unstable. 



(16) 



20 In transient analysis only the beginning of the signal is of 

interest, and if CO 0 » I Eq.(14) will act as a band-pass filter. 



Speech processing is based on fast energy pulse generated by the 
vocal cords or by friction In the articulation channel weighted by 



the impulse response in the articulation channel. The rise time for 
the excitation pulses has to be sufficient faster than the rise 
time of the energy of the impulse response. 

5 The shape of energy pulses are important features in speech. If the 
time between the pulses is periodical it is voiced speech, and if 
not it is unvoiced speech. For some phonemes abrupt changes in the 
energy pulses are important, 

10 From WO 94/25958 it is known that the shape of the energy pulses 
are important for speech recognition, especially the leading edge. 
In the following a method to extract features will be developed 
based on an envelope detection. 

15 The convolution expressed in Eq. (9) can be regarded as a response 
from 2 poles in the articulation channel excited by an impulse. If 
o~ 0 ~ a we have from Eq. (9a) 




-of 



The envelope is defined as 



e{t) = Vw 2 (/) + n?(0 



25 where 



S{t) = u(t)* — 



is the Hilbert Transform. 



30 The envelope of Eq.(17) is then 
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1^2(1 -cos(^? - <s> 0 )0 



\co - 0) 0 j 



= r^— it 1 " tcos((^ - © 0 )o) < 18 ) 

\CD-Q) 0 \ 

5 

The approximation is acceptable because |cos((<2> - a> 0 )t)\ < 1 

As expected the envelope has a component with the difference 
frequency of the 2 frequencies. 

10 

The conclusion is that we can expect to find damped difference 
frequencies in the envelope of the transient component. 

To detect the damped difference frequencies a filter bank is used. 
15 The features might be detected as a convolution between the 
transient pulse and the impulse response of the filters. 

In general form the impulse response can be written as 
20 h(t) = Ice-** sm{f{X)t + <f>) 

Where o-X and CO-f(X), 

In the following analysis /(A) = 15/1, k = w = \5X , and ^=0 are 
25 selected and we have 

h(t) = \5Xe~ M $m{\5Xt) ( 19 > 

By selecting CO = 1.5a Eq.(19) will act as a band-pass filter with a 
30 low Q in relation to the frequencies. Other ratios co/a than 1.5 may 
be selected and it is presently preferred that the ratio (©/a) 
ranges from 0.5 to 2.5. The exponential function gives the advance 



22 

that it acts like natural time window that ensure that the signal 
is natural damped. The value of the parameters are selected by 
studying rise times in important transient pulses and by 
experiments . 

5 

Fig. 6 shows transient characteristics in speech signals. The top 
figure shows 50 ms of an 'a" in "hard key" pronounced by a female. 

The second signal is a band-pass filtration of the speech signal. 
10 The band-pass filter is a Butterworth filter with 6 poles and a 
band width from 2150 to 3550 Hz. This frequency band contains 
important transient pulses in the sensitive frequency interval of 
the ear. 

15 The third signal is a energy detection of the transient 

characteristics of the band-pass filtered speech signal. The 
detection is an envelope detection performed by means of a 
rectification and a low-pass filtration of the signal. The filter 
is a Butterworth filter with 3 poles and a cut-off frequency at 70C 

20 Hz. 

In WO 97/09712 a method for automatically detecting the leading 
edges is disclosed. The method uses the maximum slope of the 
leading edge as reference, and the point before the maximum slope 
25 where the slope is less than a given threshold (10-20 % of the 
maximum slope) the leading edge is defined to begin. 

The transient (envelope) signal in Fig. (6) has a DC component, 
which does not contain any information. Therefore it is preferred 
30 that the signal is differentiated before it is analysed e.g. by tb 
filter bank shown in Fig. 13. 

In Fig. 13, the filters (h x (t), h 2 (t),..., h n (t)) in the filter bank 
connected between the input and the envelope detectors are band- 
35 pass filters having bandwidths corresponding to the bandwidths of 
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the band-pass filters of the cochlea and having centre frequencies 
ranging from 1400 Hz to 6500 Hz. 

The output signals o^ip) from the filter bank shown in Fig. 13 is 
5 calculated by: 

\ (p) = 1.5 V"* sin(A M p) , i=o , l , n-1 

j=0,l,...,M-l 

10 h 9 (p) = 0, P < o 

°v O) = & ip-k), p=o , i , p-i 

m=0, 1, ...,M-1 and M is the number of band-pass filters with a low Q 
in the filter bank connected between the outputs and the envelope 
15 detectors, p = 0,1,.../ P-l is the sample number, t' is the 
differentiated transient signal, and X m is the filter bank 
parameter and it is normalised by the sampling frequency. 

In the analysis M is selected to 10 and 1500 < k x m < 12000 s" 1 , X m is 
20 not normalised. By this we have 1885 < <2> m < 18850 s" 1 or 
300 <f <3000 Hz. 

This filtering process is not done in the cochlea but in the hair 
cells or in the nerve system behind the hair cells. 

25 

The Figs. 7, 8, 9, 10, 11, and 12 show the output of the processing 
of transient signals in the vowels v a" , " o" , " i" in "hard key" and 
"soft key" pronounced by a female and a male. Further the figures 
show plots of maxima of the output signals as a function of the 
30 time constant a of the corresponding filter. 

The figures show that maximum curves are very much alike for the 
same vowels, independent of whether a female or male pronounces it. 
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With a library of templates and a distance measure it is possible 
to identify the sound picture, and it can be used for speech 
recognition and narrow band communication. 

Thus, according to the invention a method and an apparatus are 
provided for determination of a parameter of a system generating a 
signal containing information about the parameter, in which the 
signal is short time transformed substantially in accordance with 



10 



in which v x is the signal, L is the transformed signal, a is a time 
constant, co is an angular frequency, and cp is a phase, or, in 
accordance with another transformation which will give rise to an 
15 L' (a,o,t) which in time intervals within which L(a,co,t) is larger 
than 10% of its maximum value is not more than 50% different from 
the result given by the short time Laplace transformation. 

In narrow band communication the transient pulses have to be 
20 identified and coded, and the decoder will contain a library of 

filters with corresponding transient responses. The decoder library 
could also contain the transient responses. 

The present invention also relates to measurement of mechanical 
25 vibrations e.g. when testing devices that generate mechanical 
energy during operation, such as mechanical devices with moving 
parts, such as compressors for refrigerators, electric motors, 
household machines, electric razors, combustion engines, etc, etc. 

30 For example, it is known that measurement of vibration generated or 
sound emitted by a device during operation can be useful for 
detection of malfunction of the device. Certain failures may 
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generate sound or vibration of specific characteristics that can be 
recognised . 

The method may also comprise steps of classification for 
5 classifying a tested device in accordance with the determined 
parameters into one class of a set of predefined classes. Each 
predefined class may be defined by a set of upper and lower limits 
for specific parameters determined according to the method. A 
device may then be classified as belonging to a certain class if 
10 its corresponding parameter values lie within corresponding upper 
and lower limits of the class. 

Each class may correspond to a specific type of failure of the 
device. For example, shaft imbalance, wheel imbalance, crookedness, 

15 imperfections of teeth in cogs, tight bearing, loose bearings, etc, 
may cause the device to vibrate in different characteristic ways, 
whereby a characteristic mechanical vibration or sound is generated 
for each type of failure. The type of failure of the device may 
then be detected by comparing determined device parameters with 

20 corresponding parameter values of various predetermined classes. 

The upper and lower limits of a specific class of devices may be 
determined by testing a set of devices known to belong to that 
class. For example, the upper limits may be determined as the 
25 average of specific parameter values plus three times the standard 
deviation. Likewise, the lower limits may be determined as the 
average of parameter values minus three times the standard 
deviation . 
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CLAIMS 



1. A method for determination of a parameter of a system generating 
a signal containing information about the parameter, comprising the 
5 step of short time transforming the signal substantially in 
accordance with 



in which v x is the signal, L is the transformed signal, cy is a time 
10 constant, co is an angular frequency, and cp is a phase. 

2. A method according to claim 1, wherein the step of transforming 
comprises filtering the signal v ± with a filter having a pole at a 
+ jot and a pole at a - jot. 



3. A method according to claim 1 or 2, comprising steps of 
transforming the signal v x for a plurality of sets of a and co 
values . 

20 4 . A method according to any of the preceding claims, further 
comprising the step of determining a maximum of at least one 
transformed signal L(a,co,t). 

5. A method according to any of the preceding claims, further 
25 comprising the step of comparing transformed signals L with 

corresponding reference signals in order to determine parameters of 
the system. 

6. A method according to any of the preceding claims, further 

30 comprising a step of pre-processing the signal before the step of 
short time transforming, the pre-processing being selected from the 




o 



15 
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group consisting of filtering, rectification, differentiation, 
integration, and amplification. 

7. A method of transmitting a signal containing information of a 
5 set of parameters of a system generating the signal, comprising 
processing the signal according to any of the preceding claims and 
further comprising the step of transmitting the determined 
parameter values. 

10 8. A method according to claim 7 further comprising the step of 
generating a copy of the signal from the transmitted parameter 
values . 

9. A method of transmitting a signal containing information of a 
15 set of parameters of a system generating the signal, comprising 

processing the signal according to any of the preceding claims and 
further comprising the steps of 

comparing the signal with a library of signals generated for a 
20 predetermined set of parameter values by the system, 

selecting the library function that constitutes the best match to 
the signal, and 

25 transmitting an identification signal that identifies the matching 
library function. 

10. A method according to claim 9, further comprising the steps of 
receiving the identification signal and generating the 

30 corresponding library signal. 

11. A method of classifying a system according to one or more 
parameters of the system generating a signal containing information 
about the one or more parameters, comprising determining the one or 

35 more parameters according to any of claims 1-6 and further 

comprising the step of classifying the system in accordance with 



28 



the one or more determined parameters into one class of a set of 
predefined classes defined by predetermined ranges of values of the 
parameters . 



5 12. A method for communicating an auditory signal, comprising 

processing the signal by the method according to any of claims 1-6, 
transmitting the processed signal, and receiving the processed 
signal by a receiver. 



10 13. A method according to claim 12, wherein, prior to transmission 
of the processed signal, the signal is coded into a digital 
representation, and the coded signal is decoded in the receiver so 
as to reestablish transient pulse shapes perceived by an animal ear 
such as a human ear as representing the distinct sound pictures of 

15 the auditory signal. 

14. A method according to 
transmission is performed 
per second. 

20 

15. A method according to claim 14, wherein the bandwidth is at the 
most 2000 bits per second. 



claim 13, wherein the digital 

at a bandwidth of at the most 4000 bits 



16. A method according to claim 15, wherein the bandwidth is in the 
25 interval of 800-2000 bits per second. 

17. A method according to any of claims 13-16, wherein a second and 
further pulses in a sequence of identical pulses are represented by 
a digital value indicating repetition. 

30 

18. A method according to any of claims 1-6, comprising filtering 
the signal v x in a filter bank comprising a plurality of band-pass 
filters interconnected in parallel with centre frequencies ranging 
from 1400 Hz to 6500 Hz, each of which is connected in series with 

35 an envelope detector and a filter bank comprising a plurality of 
low-pass filters interconnected in parallel and having cut-off 
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frequencies ranging from 300 Hz to 3000 Hz and time constants a 
ranging from 1500 s" 1 to 12000 s" 1 . 

19. An apparatus for determination of a parameter of a system 

5 generating a signal containing information about the parameter, 
comprising a processor that is adapted to short time transform the 
signal substantially in accordance with 

r 

0 

10 in which v 1 is the signal, L is the transformed signal, a is a time 
constant, to is an angular frequency, and cp is a phase. 

20. An apparatus according to claim 19, wherein the processor 
comprises a filter for filtering the signal v 2 and having a pole at 

15 a + jcot and a pole at a - jcot. 

21. An apparatus according to claim 19 or 20, wherein the processor 
comprises a plurality of filters for filtering the signal v ir each 
filter having a different set of a and co values. 

20 

22. An apparatus according to claim 19, wherein the apparatus 
comprises a communication channel transmitter, and the processor is 
adapted to determine the one or several parameters of the system, 
and 

25 

to transmit the one or several system parameters over a wireless or 
a cable communication channel. 
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ABSTRACT 

The present invention is related to a method and an apparatus for 
determination of a parameter of a system generating a signal 
5 containing information about the parameter. The method comprises 
the step of short time Laplace transforming the signal and may be 
utilised for classifying the system in question in accordance with 
one or more determined parameters into one class of a set of 
predefined classes defined by predetermined ranges of values of the 

10 parameters. The invention also relates to the use of a shape of 

energy changes of a signal for identifying or representing features 
of the system generating the signal. This use may be applied to 
recognition of sound features perceivable by e.g. a human ear as 
representing a distinct sound picture. It has for example been 

15 found that the signal information relevant to recognition of speech 
is present in a transient part of the speech signal. 
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Speech signal 




Transient isolation in the speech signal, band width 21 50-3550 Hz 




Energy detection of the transient pulses by means of envelop detection, 
rectified and low pass filtered at 700 Hz 
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The present invention relates to a method for determination of a 
5 parameter of a system generating a signal containing information 
about the parameter. 

The method may be used for identification of sound or speech 
signals, such as in speech recognition, or for quality measurement 
10 of audio products or systems, such as loudspeakers, hearing aids, 
telecommunication systems, or for quality measurement of acoustic 
conditions. The method of the present invention may also be used in 
connection with speech compression and decompression in narrow band 
telecommunication . 

15 

The method may also be used in analysis of mechanical vibrations 
generated by a manufactured device during operation e.g. for 
detection of malfunction of the device. 

20 The method may further be used in electrobiology for example for 
analysis of neuroelectrical signals such as analysis of signals 
from an electroencephalograph, an elect romyograph, etc. 

BACKGROUND OF THE INVENTION 

25 

Prior art methods of signal processing are based on a short time 
Fourier transform of signals and it is assumed that the signals are 
steady state signals. 

30 In steady state analysis the signal is assumed stationary in the 
period the signal is analysed and the steady state spectrum is 
calculated. 

In real life steady state signals do not occur and steady state 
35 analysis does not provide sufficient knowledge of phenomena within 
various scientific and technological fields. Consider for example 
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speech analysis. The human ear has the ability to simultaneously 
catch fast sound signals, detect sound frequencies with great 
accuracy and differentiate between sound signals in complicated 
sound environments. For instance it is possible to understand what 
5 a singer is singing in an accompaniment of musical instruments. 

It is assumed that the cochlea in the human ear can be regarded as 
comprising a large number of band-pass filters within the frequency 
range of the human ear . 

10 

The time response f (t) for one band-pass filter due to an 
excitation can be separated into two components, the transient 
response, f c (t), and the steady state response, f s (t), 
f (t)=f t (t)+f.(t) . 

15 

Traditional signal processing is based on the steady state response 
f«(t), and the transient response f t (t) is assumed to vanish very- 
fast and to be without importance for the perception, see for 
example "Principles of Circuit Synthesis", McGraw-Hill 1959, Ernest 
20 5. Kuh and Donald O. Pederson, page 12, lines 9-15, where it is 
stated that : 

"only the forced response is considered while the response due to 
the initial state of the network is ignored". 

25 

Thus, when students are introduced to the world of signal analysis, 
they learn that the transient response, i.e. the response due to 
the initial state of the network should be ignored because it 
vanishes within a very short period of time. Furthermore, it is 
30 rather difficult to analyse these transient signals by use of 
traditional linear methods of analysis. 

The ability of the human ear to hear very short sounds and at the 
same time detect frequencies with great accuracy is in conflict 
35 with the traditional filterbased spectrum analysis. The time window 
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{twice the rise time) of a band-pass filter is inversely 
proportional to the bandwidth, tw=2/ ( f u - f x ) , 

where f x is the lower cut-off frequency and f u is the upper cut-off 
frequency. 

5 

Thus, if a rise time of 5 ms is required the consequence is that 
the frequency resolution is no better than 400 Hz. 

As the detection of these transients is in conflict with a high 
10 frequency resolution, the detecting by the human ear of these 
transients must take place in an alternative manner. It has not 
been examined how the human ear is able to detect these signals, 
but it might be possible that the cochlea, when no sounds are 
received, is in a position of rest, where the cochlea will be very 
15 broad-banded. When a sound signal is received, the cochlea may 
start to lock itself to the frequency component or components 
within the signal. Thus, the cochlea may be broad-banded in its 
starting position, but if one or more stable frequencies are 
received the cochlea may lock itself to this frequency or these 
20 frequencies with a high accuracy. 

Today it is known that the nerve pulses launched from the cochlea 
are synchronized to the frequency of a tone if the frequency is 
less than about 1.4 kHz. If the frequency is higher than 1.4 kHz 
25 the pulses are launched randomly and less than once per cycle of 
the frequency . 

Signal processing based on filter bank spectrum analysis is 
disclosed in GB 2213623 which describes a system for phoneme 

30 recognition. This system comprises detecting means for detecting 
transient parts of a voice signal, where the principal object of 
the transient detection is the detection of a point where the 
speech spectrum varies most sharply, namely, a peak point. The 
detection of the peak points is used for more precise phoneme 

35 segmentation. The transient analysis of GB 2213623 is based on a 
spectrum analysis and the change m the spectrum, which is very 
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much different to the transient analysis of the present invention 
which is based on a direct transient detection in the time domain. 
SUMMARY OF THE INVENTION 



5 The present invention provides an approach which is different in 
principle from all known methods for processing signals. The 
approach taken and some of the results obtained will be explained 
by of an example in the context of analysis of speech signals. 

10 Speech is produced by means of short pulses generated by the vocal 
chords in the case of voiced speech and by friction in the vocal 
tract in the case of unvoiced speech. The pulses are filtered by 
the vocal tract that acts as a time-varying filter. The output 
response will consist of quasi steady state terms and also 

15 transient terms. The quasi steady state terms will only be damped 
slightly in the period before the next pulse is generated. The 
transient terms will be sufficiently damped in the time period 
before the next pulse is generated. 

20 The speech signal is often assumed to have only quasi steady state 
terms in the period or time window of the analysis, typically 20-30 
ms . 



The placement of formants, the formants being energy bands in the 
25 short time power spectrum, are calculated by means of a short time 
spectrum analysis has previously been assumed decisive for speech 
intelligibility, together with voiced/unvoiced detection, the pitch 
and the quasi steady state power. 

30 However, a number of observations, which has been performed within 
the field of auditory perception research, does not conform to the 
previous assumptions : 

Why is it possible to understand and identify a deep male voice 
35 through communication channels that have a higher cut-off frequency 
than the male pitch. 
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The only difference between the pronunciation of the letters: e, b, 
d is in the first 1-3 ms of the voice signal and this information 
will be lost if the analysis have a time window of 20-30 ms . 

5 

How can the absolute placement of these formants be decisive when 
their placement is quite different for different people, 
particularly between small children and large males. 

10 Why is distortion dominated by odd order harmonics and caused by 
cross-over distortion in a class B amplifier much more disturbing 
than distortion dominated by even order harmonics caused by 
amplitude distortion in a class A amplifier. 

15 The short time power spectrum will not distinguish frequencies from 
different sources, and tones generated by other sources than the 
speech signal will act like false formants. 

Why does a signal consisting of three tones with the same 
20 frequencies as the formants for a vowel not give the slightest 
perception of the vowel at all? The signal just sounds like three 
separate tones . 

Why is the ear very sensitive to frequency changes of a signal up 
25 till about 1000 Hz, changes of + /- 3 Hz can be detected. For 
frequencies above 1000 Hz, the sensitivity is much smaller. 

The research performed by the present applicant leads to suggest 
that the ear is tone dominant until about 1.4 - 1.6 kHz and 

30 transient dominant above. Tone dominant means that the pulses 
launched from the hair cells as a response to a tone signal are 
synchronised to the tone signal. Transient dominant means, in the 
present context, that the hair cells are activated by changes of 
the energy with rise and fall times of at most 2 ms typical caused 

35 by transient pulses . 
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Regarding speech signals, it is assumed that the quasi steady state 
terms are in the tone dominant interval of the ear and that the 
transient terms are in the transient dominant interval . It is 
believed that the transient terms are very important for speech 
5 intelligibility. The transient terms are seen as transient pulses 
in the speech signal . The rise time and the shape of leading and 
lagging edges of the envelope of transient pulses in the terms of a 
profile of damped frequencies describes the sound picture. The 
shape of the leading and lagging edges, the dynamic changes, change 
10 of amplitude, of the transient pulses, voiced/unvoiced detection 
and the changes of pitch are decisive for speech recognition. 

This approach provides a number of advantages with respect to 
explaining the earlier mentioned speech perception observations. 

15 

A natural explanation as to why it is possible to understand and 
identify a deep male voice through communication channels that have 
a higher cut-off frequency than the male pitch is provided. The 
pitch can be detected as the period between transient pulses. 

20 

The absolute placement of formants is not decisive. The damped 
frequencies profile of the shape of the transient pulse envelope is 
dominated by damped difference frequencies of the transient terms. 

25 Distortion caused by cross -over distortion in a class B amplifier 
generates abrupt energy changes {unwanted transients) which are 
much more disturbing than distortion caused by amplitude distortion 
in a class A amplifier which do not generate the same abrupt energy 
changes . 



30 



Robust data- or telecommunication is based on modulation. The 
envelope of transient pulses is a kind of amplitude modulation, 
transient or impulse response modulation, and will have the same 
advantages . 



35 
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It is unlikely that frequencies from otner sources will cause 
interference patterns with the speech signal that gives energy 
changes with time constants and shapes m the range that is 
decisive for speech intelligibility. This means that transient 
5 modulation will be robust in noisy environments and communication 
channels . 

The ear is probably very sensitive to changes of a frequency up 
till about 10 0 0 Hz because the nerve pulses are synchronised to the 
10 frequency and the period between the pulses is a measure for the 
frequency. In the high frequency range, where the pulses are not 
synchronised to the frequency, only placement of the frequency in 
the cochlea is a measure for the frequency. 

15 According to the invention it has for example been found that the 
signal information relevant to recognition of speech is present in 
a transient part of the speech signal. Thus, the method of the 
present invention may involve a separation of the transient part of 
an auditory signal, a generation of a transient pulse corresponding 

20 to the transient part, and analysis of the shape of the pulse. In 
an auditory signal, the corresponding transient pulse may be 
repeated with time intervals, and the time interval of these 
periodic transient pulses is normally also analysed or determined. 

25 In real life, the human ear reacts to energy changes at high 

frequencies in order to recognise phonemes or sound pictures. But 
in the present method transient pulses corresponding to the energy 
changes observed by the ear are extracted at these high 
frequencies, wherefore the transient pulses preferably are 

30 transformed to the low frequency range still maintaining the 

distinct features of the sound pictures or phonemes. Thus, by using 
the principles of the invention, it is possible to obtain distinct 
features within auditory signals by examining the transformed low 
frequency signals. 
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The invention relates to the use of the shape of energy changes of 
a signal for identifying or representing features of the system 
generating the signal for example in recognition of sound features 
which can be perceived by an animal ear such as a human ear as 
5 representing a distinct sound picture are determined. 

The method of the present invention provides an expression for the 
transient conditions of the auditory signal. The method comprises a 
band-pass filtration of an auditory signal within the frequency 
10 range of the human ear and a detection of a low-pass filtered 

envelope, which envelope then can be analysed with known methods of 
signal analysis. The envelope is an expression of the transient 
part of the signal. 

15 The method of signal analysis, which should be used when analysing 
the envelope, and the characteristics of the band-pass filter, 
which should be selected, will depend on the purpose of the 
analysis. The purpose may be speech recognition, quality- 
measurement of audio products or acoustic conditions, and narrow 

20 band telecommunication. 

The invention also relates to a system for processing a signal to 
reduce the bandwidth of the signal with substantial retention of 
the information of the signal. The system may further comprise 
25 means for extracting the transient component of the auditory 

signal, and it may comprise means for detecting an envelope of the 
transient component . 

A signal may be separated into a sum of impulse responses generated 
30 by poles and zeroes in the system that has generated the signal, if 
the time between the excitation pulses are sufficient long compared 
to the duration of the impulse responses for the system. 

In WO 94/25958 it is shown that the envelope of the transient 
35 component in a speech signal is very important for its recognition 
and it is shown that the envelope of the impulse response will 
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contain exponential functions and difference frequencies defined by 
the impulse response . 

A method based on damped sinus functions to extract important 
5 features from the envelope signal is described, and examples where 
the method is used on speech signals shows that the features are 
important in speech analysis. 

Before entering into a more detailed explanation of features of the 
10 method of the invention, a few definitions will be given: 

In short time analysis the transient component in a signal is a 
matter of definition. For auditory signals, the idea is to obtain 
an expression that gives a response corresponding to the response 

15 in the cochlea to an abrupt change in the signal energy. An abrupt 
change in the signal energy corresponds to the transient component 
in the auditory signal. Thus, in the present context, the term 
"transient component" designates any signal corresponding to an 
abrupt energy change in an auditory signal. The transient component 

20 holds the signal information to be analysed and in order to analyse 
this information the transient component may be transformed to a 
corresponding transient pulse having a distinct: shape. Thus, in the 
present context, the term "transient pulse" refers to a pulse 
having a distinct shape and substantially holding the information 

25 of the transient component of the auditory signal and thus 

corresponding to an abrupt change in the energy of the auditory 
signal. As mentioned above the transient part of a sound signal may 
be repeated with time intervals and thus, in the present context, 
the term "periodic" when used in combination with a transient 

30 component, response or pulse designates any transient component, 
response or pulse being repeated with intervals. 

The term "shape" designates any arbitrary time-varying function 
(which is time-limited or not time-limited) and which, within a 
35 given time interval T r has a distinctly different amplitude level 
in comparison with the amplitude level outside the interval. Thus, 
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T p is the duration of the shape function when the shape function is 
time- limited, or the duration of the part of the function which has 
a distinctly different amplitude level in comparison with the 
amplitude level outside the time interval. 

5 

In order to extract information from the shape of the energy 
changes, one broad aspect of the invention relates to represent the 
shape of the energy changes by the short time Laplace transform of 
a transient pulse of the signal. However, several methods can be 
10 applied in order to obtain a transient pulse corresponding to the 
change in energy, but it is preferred that an envelope detection is 
being used, where the envelope preferably should be detected from a 
transient response of the energy change in the auditory signal. 

15 The energy change representing the distinct sound picture can be a 
phoneme or vowel or any other sound which gives a sudden energy 
change in an auditory signal. 

It is also an aspect of the invention to provide a method for 
20 identifying, in an auditory signal, energy changes which can be 
perceived by an animal ear such as a human ear as representing a 
distinct sound picture, the method comprising comparing the shape 
of energy changes of the signal with predetermined energy change 
shapes representing distinct sound pictures. For the identification 
25 it is preferred that the shape of the energy changes are 

represented by the shape of a transient pulse of the signal, and it 
is furthermore preferred that the shape of the transient pulse 
should be obtained by an envelope detection of a transient response 
of the energy change in the auditory signal. 

30 

The invention also relates to a method for processing a signal so 
as to reduce the bandwidth of the signal with substantial retention 
of the information of the signal, comprising extracting a transient 
part of the signal. The method may further comprise detecting an 
35 envelope of the cransient part of the signal. 
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Known methods of processing signals are based on a snort time 
Fourier transform of signals, and it is assumed that the signals 
are steady state signals. 

5 In steady state analysis the signal is assumed stable in the period 
the signal is analysed, and the steady state spectrum is 
calculated. 

In WO 94/25958 it is disclosed that transient pulses are important 
10 for speech coding and decoding in narrow band communication, for 
speech recognition and synthesis, and for sound quality in auditory 
products (i.e. loudspeakers, amplifiers and hearing aids). 

An important part of a transient signal is the exponential 
15 functions or damping ratios or time constants. The damping ratio is 
the reason that the impulse response has a finite duration. The 
fact that the transient signal is important for auditory perception 
indicates that the response from the hair cells is dependent on the 
time constants. If this is the case, it is possible that the 
20 damping ratios in the response from nerve cells in general are 
important for the human nerve system. 

Transient signals are also important in many other applications, 
among others signals generated by impacts from defects in rolling 
25 bearings and gear-boxes. 

Based on the transient signal, it is possible to determine the 
natural time constants and frequencies in the system generating the 
signal. Further it is possible to determine the excitation pulses 
30 of the system. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a time-domain representation of a linear time- 

35 invariant system, 
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Fig. 2 shows the impulse response of a Butterworth low-pass 

filter of 3. order and a cut-off frequency at 700 Hz, 

Fig. 3 shows the response with the filter relaxed for 

5 t< 0 and with a 4000 Hz tone as input at />0, 

Fig. 4 shows the s -plane with poles and the zero for //(c\co) , 

Fig. 5 shows //(a,co) for co t and co : analysed parallel with the 

10 axis, 

Fig. 6 shows transient characteristics in speech signals, 

Figs. 7-12 show processed speech signals, 

15 

Fig. 13 shows a schematic of a filter bank according to the 
present invent ion . 

DETAILED DESCRIPTION OF THE DRAWING 

20 

The importance of the transient part of a signal has been an 
overlooked phenomenon in signal analysis. 

The response of a linear system to either an impulse or a step 
25 function is defined by its transient response properties. 

The relationship between the input and the output for the linear 
time- invariant system shown in Fig. 1 can be written as the 
convolution of the input signal and the impulse response of the 
30 system: 



v „(/) = $v,(x)h(! - x)dx (1) 
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If the system is initially relaxed and the input signal v,(/) is 
zero for /< 0 then the lower integration limit of Eq. (1) can be 
replaced with zero. Eq. (l) then shows the important role played by 
the impulse response in terms of the actual signal processing that 
5 is performed by the system. It states that the input signal is 

weighted or multiplied by the impulse response at every instant in 
time and, at any specific point in time, the output is the 
summation or integral of all past weighted inputs. 

10 The impulse response of a real system has a finite duration and the 
transient response has the same duration. Fig. 2 shows the impulse 
response of a Butterworth low-pass filter of 3. order and a cut-off 
frequency at 700 Hz. Fig. 3 shows the response with the filter 
relaxed for t< 0 and with a 4000 Hz tone as input at />0. 

15 

In many processes v*(7) will be => pulse with a short duration and 
Vi(/) ^ 0 before the next pulse will be generated. 

The Laplace transform of a signal v(/) is defined by 

20 

CO 

L(s)= |v(0e~Vr (2) 

o 

CO 

0 

25 If v(0 is the impulse response h{t) for a system with 2 complex 
poles 

h^^e^^'^e'^^ , />0 (3) 

30 and 0 for / < 0 and s * -(c 0 ± ;co 0 ) . 
the Laplace transform is 
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H(s) = - °~ 



(s -kt 0 +yco 0 )(5 +a 0 ->a> 0 ) 
or 

5 



a -kt 0 + /CO 

//(a.a>) = - -J-^ (4) 

(a +a 0 +y(co -kd 0 ))(a +a 0 +y (a> -co 0 )) 



From Eq. (4) it is seen that for (a,©) — > (—a 0 ,±co 0 ) , H(g,(£>) -» ±co . 

10 

This is a well-known phenomenon and a logical consequence of this 
is as follows: 

If the signal analysed is dominated by the impulse response of the 
15 system generating the signal, it is possible to determine the 
natural time constants and frequencies for the system. 

Fig. 5 shows a plot of H(ct,cd) for co =co , and co =o> 2 . 

20 Analysing a signal along or parallel with the 303 axis will give a 
frequency profile for a given g. 



Analysing a signal along or parallel with the a axis will give a 
time constant profile for a given jco. 

25 

If a signal has a time constant profile with significant variations 
for specific frequencies, the signal is transient dominated. 
Opposite if the signal does not vary significantly for any 
frequency, the signal is steady state dominated. 

30 

A short time Laplace transform is defined by: 
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f 

1(0,©./)= \v t {t-\)e- { « + l{ * YK d}. (5) 



in which is the signal, L» is the transformed signal, a is a time 
constant, and to is an angular frequency. 

It is not possible to calculate the short time Laplace transform in 
the same way as DFT in the discrete time domain because two 

arbitrary exponential functions, e'" and e ht , are not orthogonal 
with respect to each other. 

The short time Fourier analysis in the analogue time domain is 
based on a filter bank method. In this paper an equivalent method 
will be developed for the Laplace transform. 



15 

From Eq. (1) and Eq. (3) : 



Vo (0= Jv;(/-X)e-* a+/w) ^ 

0 

20 

/ 

+ jv+t-We-^-^dk (6) 
o 

v„(0 = V(o,(£>,t) + V(a.aj) = u(l) + u'(t) 
25 where u'(t) is the complex conjugate of it(t) and we have 
R^I(ct.o>,/)] = 4v < ,(0 (7) 
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From Eq. (6) and Eq* (7) it is seen that filtering the signal v,(/) by 
a filter with the impulse response /?(a,co,/) with 2 complex poles 
will represent the reel part of the short time Z,(o\co,/) transform. 

5 If we let v,(f) be equal to the impulse response of a single pole we 
have 



«(/) = jke-^^'-^e'^+^dk 



o 



10 



= i C e- (a "*"*" ) ' je^o+'^e-^+J^dx (8) 



(a -a 0 ) + y'((D -©„) 



15 



and from Eq. (7) we have 

20 v (0 = 2k{a ~ CT(>)(g ~°' cos (°>0- e ~ g " f cos(oy)) 

(a -a 0 ) 2 +((D -<o 0 ) 2 

2k(a> -a> 0 )(e~°' sin(coQ - e' q,|f sin(tiy)) 
(a -o 0 ) 2 +(a> -o> 0 ) 2 

v„(0 -c 0 )cos(co 0 /)-((o -(o 0 )sin(co 0 /)) 

2£ (a -cr 0 ) : +(a> -co 0 ) J 



or 



(9a) 



25 
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-e" q/ ((g -q 0 )cos(G)/)-(co -03 n )sin(a>/)) 
(a -a 0 ) 2 +(<o -to 0 ) : 



(9b) 



5 Eq. (9) is not defined for (a ? a> ) = (a 0 ,co 0 ) but from (8) we have in 
this case 



and we have v o (t)—>0 for / -» oo . 

Eq. (9) shows that the gain is inversely related to a-a 0 and 

20 c>-co 0 , and when (a 0 ,co 0 ) is far from (g,cd) and e"°' - e'° lit is small, 

v o(0 fe 0- For (a 0 ,co 0 ) (a,co) v 0 (/) will have Eq. (10) as the limit, 
is not immediately to see if Eq. (9) has the maximum energy for 

(a 0 .G> 0 )<-(CT,G>) • 

25 In the DC domain Eq. (9) can be written as 




o 



and 



v 0 (/) = 2Ate* <v cos(© 0 /) 



(10) 



= 2k 




(id 



a -g 0 



30 The maximum for v (i (/) can be found as follows 
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—[a*""' -a n e^'] = 0 



when 



log(a)-log(a n ) 

5 , = ( 12 ) 



and Eq.(ll) will have the maximum for this value. 



It can be shown that t m — » ^ when a — >a 0 . 

10 

When a ~ a 0 we will have the approximated maximum with / = 



v,(^) = 2A l (13) 
a -a 0 

15 

From Eq. (13) it can be shown that 



2ke- ] 

v o — » for a— >c 0 

20 

In Eq. (11) e' av ' represent the signal to be analysed and e" 51 the 
filter. Table 1 shows the result with a filter having 
a = 100 s" 1 and the signal varying from 1 to 10000 s~ : 

25 It is not surprising that the convolution acts as a low-pass 

filter. The important fact is that the exponential function in the 
DC domain in some way acts as frequencies do in the frequency 
domain. 
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In table 1 v„,(/ /H ) is the result of a convolution where the signal 
is differentiated. The result is, as expected, a high-pass filter. 

If we look on Eq. (9a) without exponential functions it can be 
written as 



2&(sin(co0 - sin(co 0 /)) 



it is seen that for o — > oo we will have v tt — > 0 . 



a : 


100 s - 








>,„ 


v„(>,„) 




s' x 


s 






1 


0, 046516871 


0, 954548457 


0, 009545485 


10 


0 , 025584279 


0, 774263683 


0, 077426368 


100 


0, 010000000 


0, 367879441 


0, 367879441 


1000 


0, 002558428 


0, 077426368 


0 , 774263683 


10000 


0 , 000465169 


0, 009545485 


0 , 954548457 



Table 1 v„(/ m )is given by Eq.dl, 12) and 

normalised by a and 2k. v o] {f w )is a 
convolution where the signal is 
differentiated and normalised by 2k. 

For 0) «co 0 we will have 



2£(sin(a>/) - sin(co n /)) 
v\ £ — — (15) 



It can be shown that for Q — » go 0 we will have 
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v n (t) — ► 2facos(co 0 0 



(16) 



5 This result is as expected unstable. 

In transient analysis only the beginning of the signal is of 
interest, and if co 0 » 1 Eq.(14) will act as a band-pass filter. 

10 Speech processing is based on fast energy pulse generated by the 
vocal cords or by friction in the articulation channel weighted by 
the impulse response in the articulation channel. The rise time for 
the excitation pulses has to be sufficient faster than the rise 
time of the energy of the impulse response. 



The shape of energy pulses are important features in speech. If the 
time between the pulses is periodical it is voiced speech, and if 
not it is unvoiced speech. For some phonemes abrupt changes in the 
energy pulses are important. 



From WO 94/25958 it is known that the shape of the energy pulses 
are important for speech recognition, especially the leading edge. 
In the following a method to extract features will be developed 
based on an envelope detection. 



The convolution expressed in Eg. (9) can be regarded as a response 
from 2 poles in the articulation channel excited by an impulse. If 

a 0 s=a we have from Eg. (9a) 



20 



25 



30 




-a/ 



(17) 
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<?(/) = v«* (/) + ""(/) 

5 where 

«(/) = w(0*— 
Ttt 

is the Hilbert Transform. 
10 The envelope of Eq. (17) is then 



e"" 



e „(0 = i r -J (sin(to/ ) - sin(o> 0 /)) 2 + (-cos(coz) + cos(co 0 /)) 2 

|co -co 0 | 



15 1^/2(1 -cos(o -<o 0 )/) 



r(l -?cos((co -ffi 0 )0) (18) 



20 



CO — to Q | 



The approximation is legal because cos((to -co 0 )0] ^ 1 



As expected the envelope has a component with the difference 
frequency of the 2 frequencies . 

25 

The conclusion is that we can expect to find damped difference 
frequencies in the envelope cf the transient component. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/48085 PCT/DK99/00128 

22 

To detect the damped difference frequencies a filter bank is used. 
The features might be detected as a convolution between the 
transient pulse and the impulse response of the filters. 



5 In general form the impulse response can be written as 

/?(/) = ke' kt sin(/a)'+40 
Where a = X and co = f(X) . 

10 

In the following analysis f(X) = \5X , k = co = \5X , and = 0 are 
selected and we have 



15 h(t) = \5Xe' Xt sin(L5X/) (19) 

By selecting o = L5a Eq. (19) will act as a band-pass filter with a 
low Q in relation to the frequencies. Other ratios ©/a than 1.5 may 
be selected and it is presently preferred that the ratio (co/a) 
20 ranges from 0.5 to 2.5. The exponential function gives the advance 
that it acts like natural time window that ensure that the signal 
is natural damped. The value of the parameters are selected by 
studying rise times in important transient pulses and by 
experiments . 

25 

Fig. 6 shows transient characteristics in speech signals. The top 
figure shows 50 ms of an "a" in "hard key" pronounced by a female. 

The second signal is a band-pass filtration of the speech signal. 
30 The band-pass filter is a Butterworth filter with 6 poles and a 
band width from 2150 to 3550 Hz. This frequency band contains 
important transient pulses in the sensitive frequency interval of 
the ear. 
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The third signal is a energy detection of the transient 
characteristics of the band-pass filtered speech signal. The 
detection is an envelope detection performed by means of a 
rectification and a low-pass filtration of the signal. The filter 
5 is a Butterworth filter with 3 poles and a cut-off frequency at 700 
Hz. 

In WO 97/09712 a method for automatically detecting the leading 
edges is disclosed. The method uses the maximum slope of the 
10 leading edge as reference, and the point before the maximum slope 
where the slope is less than a given threshold (10-20 % of the 
maximum slope) the leading edge is defined to begin. 

The transient (envelope) signal in Fig. (6) has a DC component, 
15 which does not contain any information. Therefore it is preferred 
that the signal is differentiated before it is analysed e.g. by the 
filter bank shown in Fig. 13. 

In Fig. 13, the filters (h^t), h 2 (t),..., h n (t)) in the filter bank 
20 connected between the input and the envelope detectors are band- 
pass filters having bandwidths corresponding to the bandwidths of 
the band-pass filters of the cochlea and having centre frequencies 
ranging from 1400 Hz to 6500 Hz. 

25 The output signals o 13 (p) from the filter bank shown in Fig. 13 is 
calculated by: 



i = 0, l 



r *■ 



...,N-1 



30 



3 = 0,1 M-l 



Mp) = o, 



p < 0 




p=0 , 1 



P-l 
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m=0, 1,.-, M-l and M is the number of band-pass filters with a low Q 
in the filter bank connected between the outputs and the envelope 
5 detectors, p = 0,1,...,?-! is the sample number, t' is the 

differentiated transient signal, and X tii is the filter bank 
parameter and it is normalised by the sampling frequency. 

In the analysis M is selected to 10 and 1500 < )J m < 12000 s~\ X' m is 

10 not normalised. By this we have 1885 <co w < 18850 s" 1 or 300 < f m < 3000 
Hz. 

This filtering process is not done in the cochlea but in the hair 
cells or in the nerve system behind the hair cells. 

15 

The Figs. 7, 8 , 9, 10, 11, and 12 show the output of the 
processing of transient signals in the vowels "a" , tt o" , tt i" in 
w hard key" and "soft key" pronounced by a female and a male. 
Further the figures show plots of maxima of the output signals as 
20 function of the time constant of the corresponding filter. 

The figures show that maximum curves are very much alike for the 
same vowels , independent of whether a female or male pronounces it 

25 With a library of templates and a distance measure it is possible 
to identify the sound picture, and it can be used for speech 
recognition and narrow band communication. 

Thus, according to the invention a method and an apparatus are 
30 provided for determination of a parameter of a system generating a 
signal containing information about the parameter, in which the 
signal is short time transformed substantially in accordance with 
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0 

in which v x is the signal, L is the transformed signal, a is a time 
constant, co is an angular frequency, and is a phase, or, in 
accordance with another transformation which will give rise to an 
5 L' (a,co,t) which in time intervals within which L(a,a),t) is larger 
than 10% of its maximum value is not more than 50% different from 
the result given by the short time Laplace transformation. 

In narrow band communication the transient pulses have to be 
10 identified and coded, and the decoder will contain a library of 

filters with corresponding transient responses. The decoder library 
could also contain the transient responses. 

The present invention also relates to measurement of mechanical 
15 vibrations e.g. when testing devices that generate mechanical 
energy during operation, such as mechanical devices with moving 
parts, such as compressors for refrigerators, electric motors, 
household machines, electric razors, combustion engines, etc, etc. 

20 For example, it is known that measurement of vibration generated or 
sound emitted by a device during operation can be useful for 
detection of malfunction of the device. Certain failures may 
generate sound or vibration of specific characteristics that can be 
recognised. 

25 

The method may also comprise steps of classification for 
classifying a tested device m accordance with the determined 
parameters into one class of a set of predefined classes. Each 
predefined class may be defined by a set of upper and lower limits 
30 for specific parameters determined according to the method. A 
device may then be classified as belonging to a certain class if 
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its corresponding parameter values lie within corresponding upper 
and lower limits of the class. 

Each class may correspond to a specific type of failure of the 
5 device. For example, shaft imbalance, wheel imbalance, crookedness, 
imperfections of teeth in cogs, tight bearing, loose bearings, etc, 
may cause the device to vibrate in different characteristic ways, 
whereby a characteristic mechanical vibration or sound is generated 
for each type of failure. The type of failure of the device may 
10 then be detected by comparing determined device parameters with 
corresponding parameter values of various predetermined classes. 

The upper and lower limits of a specific class of devices may be 
determined by testing a set of devices known to belong to that 
15 class. For example, the upper limits may be determined as the 

average of specific parameter values plus three times the standard 
deviation. Likewise, the lower limits may be determined as the 
average of parameter values minus three times the standard 
deviation. 
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CLAIMS 

1* A method for determination of a parameter of a system generating 
a signal containing information about the parameter, comprising the 
5 step of short time transforming the signal substantially in 
accordance with 



in which v 2 is the signal, L is the transformed signal, a is a time 
10 constant, co is an angular frequency, and cp is a phase. 

2. A method according to claim 1, wherein the step of transforming 
comprises filtering the signal v A with a filter having a pole at a 
+ jcot and a pole at a - jot. 

15 

3. A method according to claim 1 or 2 , comprising steps of 
transforming the signal v 1 for a plurality of sets of u and co 
values . 

20 4, A method according to any of the preceding claims, further 
comprising the step of determining a maximum of at least one 
transformed signal L(o,o>,t). 

5. A method according to any of the preceding claims, further 
25 comprising the step of comparing transformed signals L with 

corresponding reference signals in order to determine parameters of 
the system. 

6. A method according to any of the preceding claims, further 

30 comprising a step of pre-processing the signal before the step of 
short time transforming, the pre-processing being selected from the 
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group consisting of filtering, rectification, differentiation, 
integration, and amplification. 

7. A method of transmitting a signal containing information of a 
5 set of parameters of a system generating the signal, comprising 
processing the signal according to any of the preceding claims and 
further comprising the step of transmitting the determined 
parameter values. 

10 8 . A method according to claim 7 further comprising the step of 
generating a copy of the signal from the transmitted parameter 
values . 

9. A method of transmitting a signal containing information of a 
15 set of parameters of a system generating the signal, comprising 

processing the signal according to any of the preceding claims and 
further comprising the steps of 

comparing the signal with a library of signals generated for a 
20 predetermined set of parameter values by the system, 

selecting the library function that constitutes the best match to 
the signal, and 

25 transmitting an identification signal that identifies the matching 
library function. 

10. A method *according to claim 9, further comprising the steps of 
receiving the identification signal and generating the 

30 corresponding library signal. 

11. A method of classifying a system according to one or more 
parameters of the system generating a signal containing information 
about the one or more parameters, comprising determining the one or 

35 more parameters according to any of claims 1-6 and further 

comprising the step of classifying the system in accordance with 
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the one or more determined parameters into one class of a set of 
predefined classes defined by predetermined ranges of values of the 
parameters . 

5 12 . A method for communicating an auditory signal, comprising 

processing the signal by the method according to any of claims 1-6, 
transmitting the processed signal, and receiving the processed 
signal by a receiver. 

10 13 . A method according to claim 12, wherein, prior to transmission 
of the processed signal, the signal is coded into a digital 
representation, and the coded signal is decoded in the receiver so 
as to reestablish transient pulse shapes perceived by an animal ear 
such as a human ear as representing the distinct sound pictures of 

15 the auditory signal. 

14 • A method according to claim 13, wherein the digital 
transmission is performed at a bandwidth of at the most 4000 bits 
per second. 

20 

15. A method according to claim 14, wherein the bandwidth is at the 
most 2000 bits per second. 

16. A method according to claim 15, wherein the bandwidth is in the 
25 interval of 800-2000 bits per second. 

17. A method according to any of claims 13-16, wherein a second and 
further pulses in a sequence of identical pulses are represented by 
a digital value indicating repetition. 

30 

18. A method according to any of claims 1-6, comprising filtering 
the signal v. m a filter bank comprising a plurality of band-pass 
filters interconnected in parallel with centre frequencies ranging 
from 1400 Hz to 6500 Hz, each of which is connected in series with 

35 an envelope detector and a filter bank comprising a plurality of 
low-pass filters interconnected in parallel and having cut-off 
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frequencies ranging from 300 Hz to 3000 Hz and time constants 
ranging from 1500 s" 1 to 12000 s* 1 . 

19. An apparatus for determination of a parameter of a system 
5 generating a signal containing information about the parameter, 
comprising a processor that is adapted to short time transform the 
signal substantially in accordance with 



10 in which v x is the signal, L is the transformed signal, a is a time 
constant, co is an angular frequency, and cp is a phase. 

20. An apparatus according to claim 19, wherein the processor 
comprises a filter for filtering the signal Vi and having a pole at 

15 c + jcot and a pole at 0 - jcot. 

21. An apparatus according to claim 19 or 20, wherein the processor 
comprises a plurality of filters for filtering the signal v x , each 
filter having a different set of a and co values. 



22. An apparatus according to claim 19, wherein the apparatus 
comprises a communication channel transmitter, and the processor is 
adapted to determine the one or several parameters of the system, 



to transmit the one or several system parameters over a wireless or 
a cable communication channel. 
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3. Order, LP, 700 Hz, Butterworth 



Step frequency response. 
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Speech signal 




Transient isolation in the speech signal, band width 2150-3550 Hz 




Energy detection of the transient pulses by means of envelop detection, 
rectified and low pass filtered at 700 Hz 



Fig. 6 
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Frank ..Illda U _ T.FONHARD 


INVENTORIES I GNaMuRB^U jj f 


'DATE , 


RESIDENCE (City. Slate & Country) ^ j £ \, 

Louisevei 13 r DK-2800 Lynqby, Denmark 


CITIZENSHIP 1 

Denmark 




POST OFFICE ADDRESS (Complete Street Address including City. State & Country) 

Louisevej 13, DK-2800 Lyngby, Denmark 


GIVEN NAME FAMILY NAME 


INVENTOR'S SIGNATURE 


'DATE 


RESIDENCE {City, State & Country) 


CITIZENSHIP 


POST OFFICE ADDRESS (Complete Street Address including City. State & Country) 


GIVEN NAME FAMILY NAME 


INVENTOR'S SIGNATURE 


•DATE 


RESIDENCE (City, State & Country) 


CITIZENSHIP 


POST OFFICE ADDRESS (Complete Street Address including City State & Country} 


GIVEN NAME FAMILY NAME 


INVENTOR'S SIGNATURE 


•DATE 


RESIDENCE (City. State & Country) 


CITIZENSHIP 


POST OFFICE ADDRESS (Complete Street Address including City State & Country) 


GIVEN NAME FAMILY NAME 


rNVENTOR'S SIGNATURE 


•DATE 


RESIDENCE (City. State & Country) 


CITIZENSHIP 


POST OFFICE ADDRESS (Complete Street Address including City, State & Country) 



